Optimizing Classifier Performance via an Approximation to the Wilcoxon-Mann-Whitney Statistic
نویسندگان
چکیده
When the goal is to achieve the best correct classification rate, cross entropy and mean squared error are typical cost functions used to optimize classifier performance. However, for many real-world classification problems, the ROC curve is a more meaningful performance measure. We demonstrate that minimizing cross entropy or mean squared error does not necessarily maximize the area under the ROC curve (AUC). We then consider alternative objective functions for training a classifier to maximize the AUC directly. We propose an objective function that is an approximation to the Wilcoxon-Mann-Whitney statistic, which is equivalent to the AUC. The proposed objective function is differentiable, so gradient-based methods can be used to train the classifier. We apply the new objective function to real-world customer behavior prediction problems for a wireless service provider and a cable service provider, and achieve reliable improvements in the ROC curve.
منابع مشابه
Optimizing Classi er Performance Via the Wilcoxon-Mann-Whitney Statistic
Cross entropy and mean squared error are typical cost functions used to optimize classi er performance. The goal of the optimization is usually to achieve the best correct classi cation rate. However, for many two-class real-world problems, the ROC curve is a more meaningful performance measure. We demonstrate that minimizing cross entropy or mean squared error does not necessarily maximize the...
متن کاملGP Classification under Imbalanced Data sets: Active Sub-sampling and AUC Approximation
The problem of evolving binary classification models under increasingly unbalanced data sets is approached by proposing a strategy consisting of two components: Sub-sampling and ‘robust’ fitness function design. In particular, recent work in the wider machine learning literature has recognized that maintaining the original distribution of exemplars during training is often not appropriate for d...
متن کاملA data-adaptive methodology for finding an optimal weighted generalized Mann-Whitney-Wilcoxon statistic
Xie and Priebe [2002. “Generalizing the Mann–Whitney–Wilcoxon Statistic”. J. Nonparametric Statist. 12, 661–682] introduced the class of weighted generalized Mann–Whitney–Wilcoxon (WGMWW) statistics which contained as special cases the classical Mann–Whitney test statistic and many other nonparametric distribution-free test statistics commonly used for the two-sample testing problem. The two-sa...
متن کاملA uniform saddlepoint expansion for the null-distribution of the Wilcoxon- Mann-Whitney statistic
The authors give the exact coefficient of 1/N in a saddlepoint approximation to the Wilcoxon-Mann-Whitney null-distribution. This saddlepoint approximation is obtained from an Edgeworth approximation to the exponentially tilted distribution. Moreover, the rate of convergence of the relative error is uniformly of order O(1/N) in a large deviation interval as defined in Feller (1971). The propose...
متن کاملAn optimal Wilcoxon-Mann-Whitney test of mortality and a continuous outcome.
We consider a two-group randomized clinical trial, where mortality affects the assessment of a follow-up continuous outcome. Using the worst-rank composite endpoint, we develop a weighted Wilcoxon-Mann-Whitney test statistic to analyze the data. We determine the optimal weights for the Wilcoxon-Mann-Whitney test statistic that maximize its power. We derive a formula for its power and demonstrat...
متن کامل